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Abstract — We show that social learning is not useful in a 
model of team binary decision making by voting, where each 
vote carries equal weight. Specifically, we consider Bayesian 
binary hypothesis testing where agents have any conditionally- 
independent observation distribution and their local decisions 
are fused by any L-out-of-iV fusion rule. The agents make local 
decisions sequentially, with each allowed to use its own private 
signal and all precedent local decisions. Though social learning 
generally occurs in that precedent local decisions affect an agent's 
belief, optimal team performance is obtained when all precedent 
local decisions are ignored. Thus, social learning is futile, and 
secret ballots are optimal. This contrasts with typical studies of 
social learning because we include a fusion center rather than 
concentrating on the performance of the latest-acting agents. 

Index Terms — Bayesian hypothesis testing, distributed detec- 
tion and fusion, sequential decision making, social learning, social 
networks 

I. Introduction 

Consider a set of agents making a decision collectively by 
voting, with each agent having equal influence. It would seem 
that the best collective decisions would come from each agent 
having as much information as possible. For example, if the 
voting is sequential rather than simultaneous, it would seem 
that the collective decision would be improved by later-acting 
agents having knowledge of the votes of the earlier-acting 
agents. Using the methods of signal processing to analyze a 
simple model, we show that this is not the case. In a social or 
political context, this provides a mathematical justification for 
the use of secret ballots, independent of any other merits, such 
as avoiding bribery, intimidation, or insincere voting. To be 
clear, though, our interest is in binary decision making under 
uncertainty, not voting to express individual preferences. 

In contemporary popular culture, the advantage of (prop- 
erly) aggregating opinions of many (reasonably diverse and 
independent) agents is known as "the wisdom of crowds" QJ. 
In statistical signal processing, this is nothing more than a 
reduction in effective noise level from averaging or aggre- 
gating. Suppose private signals observed by decision-making 
agents are not perfectly informative due to noise. If N agents 
share their private signals, assumed conditionally independent 
and identically distributed (iid) given the unknown variable of 
interest, then the sharing effectively increases the signal-to- 
noise ratio (SNR) by a factor of N. Even if a communication 
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capacity limit restricts the amount of shared information, 
sharing still helps the detection system work much better than 
a single-agent system. For example, in systems where agents 
with Gaussian noise-corrupted observations share 1-bit signals 
that are fused with the MAJORITY fusion rule, the effective 
SNR is asymptotically increased by a factor of 2N/ir |2J. 

In this paper, we consider a tearrj^] of agents that together 
make a binary decision (between and 1) by voting. Binary 
decisions often made by teams include whether to make a 
large purchase, whether to hire a job applicant, and whether 
to convict a defendant; the final example highlights that 
aggregation by voting, with equal weight to each vote, does 
not imply the use of majority rule. The agents have their own 
private signals and make local decisions in some preordained 
order. Because of this ordering, we refer to the agents as a 
sequence. The agents form a distributed detection and data 
fusion system because their local decisions are fused to reach 
a global decision, and only the global decision determines their 
performance. 

Now assume that the agents can watch other agents' choices 
as public signals so that they can use the predecessors' 
decisions in decision making. The framework of sequential 
decision making was independently introduced in |4| and |5|. 
A key concept in these works is herding. A herd of agents 
takes place when all agents beyond some index have the same 
local decision. Both works found that a herd of agents making 
the incorrect decision might arise with positive probability 
when private signals are bounded; this is a disappointing 
occurrence since the optimal aggregation of private signals 
would result in vanishing error probability. Subsequently, (6| 
showed that agents will asymptotically settle on the correct 
decision if private signals are unbounded. Recently, |7j extends 
the result to general network topologies where each agent can 
observe decisions made by its neighbors instead of all previous 
agents. 

This work contrasts from previous works in two key ways: 
we consider an arbitrary but finite number of agents N; and 
we consider any symmetric fusion of the local decisions (i.e., 
voting by an L-out-oi-N ruld^]). Because the number of agents 
is finite, presence or absence of herding is not central to our 
study; a herd of agents might arise and yet form a small 

The term team implies that the agents have no conflicts of interest, which 
here translates to agreement in the relative importance of false alarms and 
missed detections 131 - 

2 In the notation of L-ovA-of-N rule, N denotes the number of agents and 
L denotes the minimum number of 1 votes for 1 to be the global decision. 
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fraction of the N total agents. Also, the fusion by voting makes 
an early-acting agent important not only for its influence on 
later-acting agents but also because its vote counts. 

As the first to study this scenario, we ask whether it is 
beneficial for the agents not only to send their local decisions 
to the fusion center but also to share among themselves. 

The public signals raise two changes to the model: the belief 
update that is standard in social learning and a fusion rule 
update that has not appeared previously in the context of social 
learning. 

Each agent will update its belief about the true state based 
on the public signals (in a Bayesian setting) and design a better 
decision rule. The previous decisions reflecting the private 
signals of the precedent agents contain information that the 
following agents do not have. If an agent observes that most 
predecessors have chosen 1, then the agent will have a stronger 
belief on 1 and a weaker belief on 0. Hence, the update of 
belief is a positive feedback that encourages an agent to follow 
the precedents. 

Each agent will adjust the fusion rule based on the public 
signals. For example, if the first agent has chosen 0, then 
the L-out-of-A fusion rule changes to the L-out-of-(A— 1) 
rule for the remaining agents. And if the second agent has 
subsequently chosen 1, then the fusion rule becomes the 
(L— l)-out-of-(A— 2) rule, etc. Thus, the previous decisions 
put a following agent in a different position; if most previous 
agents have chosen 1, then a few more votes for 1 will 
determine the global decision as 1. This fact makes the agent 
become more careful to choose 1. The evolution of the fusion 
rule is a negative feedback that discourages an agent to follow 
the precedents. 

Our mathematical analysis shows that those positive and 
negative effects are exactly canceled out and that the optimal 
decision rules with public signals are equal to those without 
public signals. It implies that observing public signals is 
practically useless. 

Section III] describes our decision-making model formally. 
Section |IIl| provides results for two agents and Section [IV 
extends the results for an arbitrary number of agents. SectionjV 
concludes the paper. 

II. The Formal Model 

A team of A agents — Alexis, Blake, Norah — performs 
a binary hypothesis test for an object. The agents are aware 
of the prior probabilities of the object in state H = and 
in state H = 1, which are denoted by po = F{H = 0} and 
pi = ¥{H = 1} = 1 — pq. The agents individually receive 
private signals about the state and perform tests to make their 
own binary decisions They are a team in the sense that 
they place the same relative importance on false alarms and 
missed detections and share a common goal to minimize the 
cost of global decisions. The cost for a false alarm is cio and 
the cost for a missed detection is cqi. Thus, for the L-out-of- 
N fusion rule, the average cost (i.e., Bayes risk) is given as 
follows: 

R = c 10 poP{£ti ^ > L | H = 0} 

+ c iPiP{£^ 1 i? i <L-1\H = 1}. (1) 



The private signals are conditionally iid given the state 
H, with the likelihood function fy^H- In other words, all 
agents observe private signals of equal quality. Bayesian agents 
use a likelihood ratio test (LRT) as the optimal decision rule, 
which chooses Hi = 1 if the ratio fv^H^yi I 1)/ ' fv^HiVi I 0) 
is greater than a certain threshold |8j, j9). We assume that the 
likelihood ratio is an increasing function of yi so that the LRT 
is simplified to a decision rule with a decision threshold Aj: 

Hi{ yi )=l 

y l | Ai. (2) 

We will compare the optimal decision thresholds in the 
following two scenarios: 

1) A common distributed detection system: Agents only 
observe private signals and make local decisions in par- 
allel without knowing other agents' decisions. Then their 
decisions are fused by a specific L-out-of- A rule to make 
a global decision. Alternatively, we call this scenario 
parallel decision making. 

2) A distributed detection system combined with sequential 
decision making: Agents observe private signals and 
sequentially make local decisions. Before each agent 
makes a decision, the agent observes precedents. Then 
their decisions are fused by a specific L-out-of-A rule to 
make a global decision. 

Our notations for decision thresholds are distinguished in the 
two scenarios: A in the first scenario and p in the second 
scenario. 

In Scenario 1, we will say that the optimal decision thresh- 
olds are identical, i.e., A^ = • • ■ = X* N for the following 
reasons: Using identical decision thresholds is asymptotically 
optimum for the binary hypothesis testing problem |10|. 
Furthermore, by numerical experiments, it turns out that con- 
straining to identical decision rules causes little or no loss of 
performance for finite A and the corresponding optimal fusion 
rule has the L-out-of-A form 1 1 1 ]. Our numerical experiments 
(not reported here) for fixed fusion rules and conditionally iid 
private signals show that the optimal decision thresholds are 
in fact identical at least for any N < 7 in multiple cases with 
such as Gaussian and exponential likelihood functions. 

We use P\ and P* 1 as notations for probabilities of Type I 
(false alarm) and Type II (missed detection) errors. We use a 
subscript index to indicate a specific agent. For example, error 
probabilities of Alexis are written as 

P\ ! = ¥{Ht =1\H = 0}, P]\ = F0! =0\H = 1}. 

(3) 

Since all agents have equally good private signals, their error 
probabilities are the same as Q in the parallel decision- 
making scenario. 

In Scenario 2, the agents sequentially make hard decisions: 
Alexis first makes a decision Hi, Blake next makes a decision 
Hi, and so on. Predecessors' decisions, which we call public 
signals, may cause different error probabilities by following 
agents. In this case, we use superscript notation to specify 
the public signals. For example, Pj° 2 denotes the probability 
of Type I error of Blake (the second agent) upon observing 
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Fig. 1. Two agents and a fusion center, (a) Parallel decision making, (b) 
Sequential decision making. 



Hi = and Pj^ 1 denotes the probability of Type II error of 
Chuck (the third agent) upon observing H\ = and H% = 1. 
We sometimes use a notation like Pj° 3 that specifies Alexis's 
decision as Hi = but does not specify Blake's decision H 2 . 

Once the error probabilities of all agents are computed, the 
conditional probabilities of the global decision being in error 
are given from the L-out-of-A fusion rule: 

N 

^=phih(!|o)=e e u p ii n (i-^i), 

n=L '£[«] igj je[N]\I 

\I\=n 

N 

f3=pb\ B (w)= e e n 

n=AT-L+l 1C[N] igj j'eWlW 
|I|=n 

where [TV] denotes the set {1,2,..., N}. The Bayes risk is 
based on the team decision: 



R = ciqPqPe + coiPiPe- 



III. Two Agents 



(4) 



Let us consider the simplest case for distributed detection, 
which is N — 2. We will compare two cases, which are 
depicted in Fig. [T] We start from an assumption that Alexis 
does not change her strategy whether the agents observe public 
signals or not. The assumption comes from an intuition that 
she is the first agent to make a decision and does not observe 
any previous decisions in both scenarios. 

Theorem 1. Suppose that sharing previous decisions does not 
change Alexis's decision rule. Then, for N = 2 and any L- 
out-of-N fusion rule, the existence of public signals does not 
affect the decision strategies of agents. 

Proof: There are only two meaningful fusion rules when 
N = 2: l-out-of-2 (OR) rule and 2-out-of-2 (AND) rule. Let's 
consider the OR rule first. 

In the parallel decision-making scenario, the Bayes risk is 
given by 

R P = c wPo {Pis + Pl, 2 - PlxPla) + coiPiPllA (5) 

where Alexis's error probabilities 1 and P*\ are governed 
by her decision threshold Ai and Blake's P\ 2 and Pj l 2 are 



governed by his decision threshold X 2 . Their optimal decision 
thresholds A* and X 2 are the minimizer of (|5J. 

In the sequential decision-making scenario, Blake observes 
Alexis's decision Hi. He makes a decision carefully only if 
her decision is Hi — because Hi = 1 already determines 
the global decision as H = 1. Thus, we take care of Alexis's 
decision threshold, pi, and Blake's decision threshold upon 
observing Hi = 0, which is denoted by p 2 , only. The Bayes 
risk is given by 

R s = c w Po (Pli + (l- Pii) F*> ) + coipiP^Pl 1 ", (6) 



which is exactly the same formula as Q. Therefore, the 
optimal decision thresholds would be the same as those in 
the first case: 



Pi = -V 



n* 

Pi 



X* 



(7) 



and p* = p 2 * because X\ = X 2 . Since we do not care about 
Blake's decision threshold when Hi = 1, p 2 , we can set 
p 2 * = p 2 *, too. Therefore, Alexis and Blake's optimal decision 
thresholds are not affected by the existence and the value of 
the public signal Hi. 

The same statement for the AND fusion rule can also be 
proven in a similar way. ■ 



IV. N Agents 



In Section III we have shown that knowing Alexis's deci- 
sion practically does not change Blake's optimal strategies for 
N = 2 and any L-out-of-A^ fusion rule. Now let us expand 
the problem to a general A^-agent problem by mathematical 
induction. 

Theorem 2. Suppose that sharing previous decisions does not 
change Alexis's decision rule (i.e., p\ — X*). If the existence of 
the public signals does not affect optimal decision thresholds 
of a team of N agents for a specific N and any K-out-of- 
N fusion rule, then the existence of the public signals also 
does not affect optimal decision thresholds of a team of N+l 
agents and any L-out-of-(N +1) fusion rule. 

Proof: First, consider the parallel decision-making sce- 
nario with N+l agents. Since a decision of an agent is critical 
only if the other N local decisions are L—l ones and N—L+l 
zeros, the optimal decision threshold A* is the solution tcj^J 



fr\ H (X\l) _ cio Po ( L N i) (PlY 



1 - P 



Jy\h{^\0) CqiPi( 



N \( P U\N-L+1 
N-L+l) \ r e ) 



aopoipty Ui-p 



(l-^i 1 ) 
tn N-L+l 



L-l 



coiPi (PI 1 ) 



N-L + l 



(1 -PI 1 ) 



L-l ■ 



(8) 



where we embed the idea that the optimal decision thresholds 
of all agents are identical. 

Next, consider the sequential decision-making scenario 
when the agents observe previous decisions. The Bayes risk 

3 Please see [12] for a detailed description of how j8j is derived. 
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is given by 

R P = C WP0 (1 - Pl.l) P {En=2 H n >L\H 1 = H = 0} 
+ C 10 p ^lP {En=2 H n > L - 1 1 Hx = 1, H = O} 
+ coiPiP^P < I - 1 1 Hi = 0, H = l} 



CQlP! (1 - P^) P ^ 2J^" 1 

. ra=2 



^ P (p (l-P e \ 



2|JIi = H = l| 

/?, •/'!; •/':■:! /'.":>) 
(9) 



where P an d Pi are specified in (JTTJ and ( |T2] > and we define 

g°= 7 V . v ^ n =P{g = 0|ffl=0}, 

— T P ° Fe ; 1 — - T =p{ J fi = o|£i = i}. (io) 
p p f l + ft i-p" 



When the agents 2, . . . , N+l observe that Hi = 0, their 
optimal decision strategy is to minimize the term Rq from 



Ra = c w q a V {En=2 H° 



>L\H = , j 
< L 



Hi? = 1 



}, (ID 



where the condition Hi = is intended in the term H° v Please 
note that Rq is the same as the Bayes risk of N agents when 
the prior probability is q° and fusion is by the L-out-of-7V 
rule. It implies that the optimal decision thresholds of Agents 
2, ...,N+1 are the same as those of N agents with prior 
probability q<> and the L-out-of-iV fusion rule. 

Likewise, when the agents 2, . . . , iV+1 observe that Hi = 
1, their optimal decision strategy is to minimize the term Ri 
from 



Ri 



cioq 
+ coi 



1 p{EL + 2 1 ^A>i-i|^ = o} 

(1 - )P H\ < L - 2 | H = l} . (12) 



Their optimal decision thresholds are the same as those of N 
agents with prior probability q 1 and {L— l)-out-of-iV fusion 
rule. Fig. |2] depicts the evolution of the problem corresponding 
to Alexis's decision Hi. 

Let us find the optimal thresholds p^* , ■ ■ ■ , p'n+i m Pr°bl em 
BO in Fig. [2] In fact, the N agents in Problem BO also observe 
public signals. However, because of the condition that the 
existence of the public signals does not affect optimal decision 
thresholds of a team of N agents for any K-out-of-N fusion 
rule, we can find the optimal thresholds as if the agents do 
not observe the public signals. Since a decision of an agent is 
critical only if the other A^— 1 local decisions are L— 1 ones 
and N—L zeros, the optimal decision threshold p°* is the 



Hi =o 



Problem BO 

■ W agents 

■ Belief: q° 
■L/Wrule 




Optimal 
thresholds 

pi, ...,pu 



Optimal 
thresholds 

Pi, Pnvl 



Fig. 2. An (A r +l)-agent problem is divided into two Af-agent problems 
depending on Alexis's decision Hi . 



solution to 

f Y \ H (Jfi 1 1) = 

/y|ff(p ° |0) c 01 (l-go)(£- 

Ciopo (1-PJ) (P e In 



,N-L 



Pe l0 ) N L (l-Pl l °^ L 



(13) 



coiPiPl 1 (Pi 1 *)" L (l-Pl l °) 



L-l 



where q° is replaced by ( fTOj i. Due to the assumption that pi = 
A*, Pi and P e n in ([13) are the same as P\ and Pf in ((HJl. 

Comparing ( [T3| l to |8j, we can find that they have the same 
solutions, i.e., p a * = A*. Therefore, the agents should not 
change their decision thresholds after observing Hi = 0. 

We can also find the optimal thresholds pi,* , . . . , p]^ +1 in 
Problem B 1 in Fig. [2] by looking at the A-agent problem 
without public signals: 

fy\*{f 1 1) _ ci^dzl) (P^) L - 2 (l-Pt) N ~ L+1 



^(PMO) coid-q^li) (P^) 



N-L+l 



1-Pe 



Hi 



h-1 



c wPo Pl {Pi 1 



L-2 



caiPiil-Pl 1 )^)" ^(l-Pe 111 ) 



L-2 ■ 



(14) 



Again, due to the assumption that p\ = A^, P\ and Pj 1 in 
( |T4~] > are the same as P\ and P^ 1 in (|8). We reach to the same 
statement that the two equations have the same solutions, i.e., 
Pi * — ^H> by comparing ( |T4] > to ((8). Thus, the agents should 
not change their decision thresholds after observing H\ = 1, 

Consequently, for a team of A + 1 agents and any L-out- 
of-(A+l) rule, their optimal decision thresholds are the same 
whether they observe previous decisions or not. ■ 

We want to clarify the term belief in Fig. [2] which is 
distinguished from prior probability. The prior probability 
is fixed as p Q and known to all agents. The belief is how 
probable the agents think H = is. The belief can change 
as agents observe previous decisions even though the prior 
probability cannot. The above proof does not mean that the 
prior probability is changed after the agents observe Alexis's 
decision. It is saying that the problem of determining optimal 
thresholds upon observing Alexis's decision is equivalent to 
Problem B0 or Bl corresponding to Hi, 
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The essence of this proof is to show that the effect of 
new fusion rules, Lout-of-A^ and (L— l)-out-of-JV, exactly 
cancels out the effect of new beliefs q° and q 1 . It is observed 
when we compare ( fT3| l and ( fl4] > to dSJ. As we mentioned 
earlier in Section [j] the public signals seem to have the same 
amount of a positive feedback and a negative feedback. From 
an overall standpoint, the public signals neither harm nor help 
the decision-making task. 

Corollary 3. Suppose that sharing previous decisions does not 
change Alexis's decision rule (i.e., p\ — X\). For any N and 
L-out-of-N fusion rule, the existence of the public signals does 
not affect optimal decision thresholds of a team of N agents. 

Proof: By mathematical induction, it is true from Theo- 
rems Q] and 12 ■ 

Corollary [3]is trivial for N = 1 and is proven by Theorem[T] 
for N = 2. For TV > 3, we can iteratively break down the 
problem until we have 2 N ~ 2 two-agent problems like in Fig. [5] 
The backward process from the leaves (two-agent problems) 
to the root ( A^-agent problem) will demonstrate Corollary [3] 

Corollary [3] only refers to the optimal decision thresholds. 
However, it implies that team's performance (i.e. Bayes risk) 
is not affected by the public signals. This is because decision 
thresholds determine the probabilities of errors and, thus, the 
Bayes risk. 



V. Conclusion 

We have discussed sequential decision making in a dis- 
tributed detection and fusion system by a team of agents. 
It would be intuitively desired to have as much information 
as possible when performing a hypothesis test. However, our 
study has revealed that it is useless to observe other agents' 
decisions in this system. The agents just need to individually 
make the best decisions. 

This result is justified by symmetries throughout our model. 
The first symmetry is the equal quality of private signals due 
to the identical likelihood functions. The second symmetry is 
the equal (1-bit) votes that agents have. The third symmetry 
is the equal weights of the votes implied by the L-out-of-A" 
fusion rule. 

One interpretation of the main result is as follows: Each 
agent sends 1 bit of information about its private signal to 
the fusion center. If Blake refers to Alexis's decision, then the 
fusion center effectively receives less than 1 bit of information 
about Blake's private signal from him. Therefore, in order 
to prevent such an efficiency loss, Blake should make a 
reasonable decision only based on his own private signal. 

This result is obtained under an assumption that Alexis 
uses the same decision thresholds in both scenarios, and the 
assumption is not unreasonable. The assumption is trivially 
true for AT = 1. It is also true for N = 2; our proof of 
Theorem [T] even does not use this assumption. In addition, 
we have confirmed that it is true at least for any A^ < 9 by 
numerical experiments. This assumption heuristically seems 
true, and our future work is to verify it for arbitrary N. 
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